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A MODEL FOR STRATEGIC VOTING* 


DALE T. HOFFMANt 


Abstract. A model combining an individual voter’s personal preferences (a cardinal utility rating) with 
the preferences of the rest of the electorate (as measured by pre-election polls) is investigated to determine 
how a voter may maximize his expected satisfaction with the election outcome. The basic model of McKelvey 
and Ordeshook for a one man-one vote election procedure is generalized to one man-N votes, approval 
voting and committee elections. Explicit decision conditions are derived for each procedure. An approxima- 
tion technique is described and some numerical results are included. The final section extends the basic 
deterministic model to a situation in which the optimal voting strategy may be probabilistic. The model 
can be used to compare voting procedures as well as the strategies available to a voter under a particular 
voting procedure. 


1. Introduction. The usual ‘“‘one man-one vote” election system has serious 
drawbacks when the number of candidates exceeds two. A voter who always votes 
for his favorite candidate may forfeit much of his electoral power if his favorite 
candidate has little chance of winning. The voter who is willing to vote for a candidate 
other than his favorite still has the problem of deciding for whom to vote. 

Several investigators have recently examined and compared various election 
systems for multicandidate elections. Brams [4], Fishburn [10], Lucas [15], Merrill 
[18] and Weber [22] concentrated on examining the properties of the systems inherent 
in the rules for the system and showed that one or another voting system is ‘‘better’’ 
in certain respects. We take a different approach and concentrate on the individual 
voter rather than the system. 

In this paper we investigate how the individual voter may maximize his power 
by strategically using his vote(s) in the optimal manner allowed by the election system 
used. By considering both the individual voter’s personal preferences (a cardinal utility 
rating of the candidates) as well as the preferences of the rest of the electorate (as 
measured by pre-election polls), the model enables the voter to determine for whom 
to vote in order to maximize his expected satisfaction (utility) with the election 
outcome. The general model will facilitate the further comparison of election systems 
by providing a unified framework so that the optimal voting behavior of voters can 
be compared. It can also be of use to the individual voter in a multicandidate election 
and some sample calculations are included. 

A utility model for 2- and 3-candidate elections was first developed by McKelvey 
and Ordeshook [16] for elections in which each voter had one vote. We extend the 
model to elections with possibly more than one winner (committees) and in which 
each voter may have more than one vote, including approval voting elections. Methods 
for approximating the necessary numerical conditions for the model are suggested 
and numerical results are given for some elections. The calculated optimal voting 
behavior appears to agree well with the intuitive optimal behavior of voters. 


2. Assumptions and notation. Throughout this analysis the term ‘‘the voter” will 
refer to that individual voter whose optimal course of voting action will be determined. 
Two basic pieces of information about the voter and the rest of the electorate will be 
needed: (1) It is assumed that the voter has a cardinal personal preference scale on 
which he rates his expected satisfaction with each of the possible election outcomes; 
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and (2) It is assumed that some information about the preferences of the rest of the 
voting population is known. The term “‘the voter’s rating of candidate A”’ will mean 
the voter’s expected satisfaction with the election result if candidate A is elected. The 
information about the preferences of the rest of the electorate would most commonly 
come from pre-election polls, but could be the result of educated guesses. It is supposed 
that the electorate’s preferences are known only imperfectly, and an estimate of the 
accuracy of this information is used in the analysis. 

The two pieces of information used in the model are the ones which create conflict 
for the concerned voter. In this analysis the concerned voter will attempt to resolve 
the possible conflict between his preferences and those of the rest of the electorate 
by voting in the way which is most likely to increase his satisfaction with the election 
outcome. 

Let N be the number of candidates in the election, and let C; = the ith candidate, 
R; =the voter’s personal rating of C;, W; =the probability that C; wins the election 
and T;; = the probability that C; and C; tie for first (i 4 j). The values of R; and T;; will 
determine the voter’s course of action in the voting systems analyzed. The values of 
the T;; are crucial to this analysis, and the model includes an explicit means of calculating 
them. Since the probabilities of three-way ties for first place are so much smaller than 
those for two-way ties, they will have negligible influence on the voter’s decisions and 
are not used here. It is supposed in elections without runoffs that if two candidates 
are tied for first, the winner will be decided in a fair manner so each of the tied 
candidates is equally likely to win. 

If C; wins the election, the voter obtains an expected satisfaction (utility) of R;, 
and if C; and C; tie for first, the voter will obtain an expected satisfaction of (R; + R,)/2. 
Thus, the overall expected satisfaction for the voter is 
(1) EWR +S Ty 


L*] 


Rather than recalculate the total expected satisfaction for the voter for every possible 
vote, the model will maximize the voter’s expected gain in satisfaction. If (1) is the 
voter’s expected satisfaction before the voter votes and the voter casts 1 vote for Ci, 
then a gain is achieved in two possible ways: (i) C; had been losing but is now tied 
for first with C;, or (ii) C; had been tied for first place with C; but now C, wins. If it 
is assumed that the probability of losing by one vote is the same as the probability 
of a tie (losing by 0 votes), then the expected gain in each case is T,;(R:—R,)/2. 
Hence, a vote for C; gives the voter a total expected gain in satisfaction of T,2(Ri— 
R2)+ T13(Ri—R3)+++++Tin(Ri- Ry). 


3. The geometry of an election. Calculating 7;,;. In a 3 candidate election, the 
set of election outcomes is {(v1, v2, v3): v; = number of votes cast for C; and v1 + 02+ v3 = 
N, the total number of votes cast by the electorate}. Each of the possible outcomes 
can be represented as a 2-dimensional outcome triangle by converting to barycentric 
coordinates (Fig. 1). A;; is that portion of the outcome triangle in which C; loses to 
C; by one vote or less; it is the region within 1 unit (vote) of the line v; =v, >v, on 
the C; side of the line. Tj; is the probability that the election result (v1, v2, v3) lies in 
the region Aj. 

McKelvey and Ordeshook [16] only applied their model to 2- and 3-candidate 
elections in which each voter had 1 vote, and they limited their numerical calculations 
to a uniform probability distribution. We shall extend their work in several directions: 
M candidate elections, elections of committees, elections in which a voter has N votes 
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Fic. 1. The geometry of a 3-candidate election. 


and ‘‘approval voting”’ elections. Numerical results for a Gaussian distribution will be 
given. 

Let E; be the percentage of the vote predicted for C; by the pre-election polls. 
We construct a Gaussian distribution F on the outcome triangle by F'((v4, v2, v3)) = 
K - exp (—(1/2)D7/S*), where D is the distance from (E1, E2, E3) to (v1, v2, v3)/N 
and S* is a measure of the uncertainty in the prediction. A Dirichlet distribution may 
be more appropriate because of the nonzero correlations between the v;, but we use 
the Gaussian for simplicity. The necessary computation is the evaluation of the integral 


Ty= | J Flo v2, 03). 


These volume calculations can be greatly simplified by observing that each region A; 
is very narrow, Only one vote wide. Thus, 7;; will be (almost) proportional to the area 
of the face of 7;,, and we will only need the ratios of 7;;’s. These areas can then be 
numerically approximated using Simpson’s rule or Romberg integration. 

For a 4-candidate election, the outcome space will be a solid tetrahedron. The 
areas of election uncertainty A;,; will be certain internal faces of the tetrahedron, and 
the 7;; can be approximated by approximate integration along a series of lines on 
each face. As the number of candidates increases, the geometry becomes more 
complicated, but the 7;; values can still be approximated. The model can also be used 
for elections in which winning is not the only goal. In some primaries it is important 
for psychological and financial reasons that a candidate not place too low. If the voter 
can assign utility values to nonfirst place outcomes, then the probabilities can be 
approximated and used to find an optimal voting strategy. 

In the following analysis of particular voting systems, we will derive critical values 
A”* such that the voter should vote in one way if the rating of a candidate is greater 
than A”* and another way if the rating is less than A*. These A* will depend on the 
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R;, the T;; and the voting system being considered. For a 3-candidate, one man-one 


vote election, 
ge RiQTi2t Tis) + Rs Tas Tis) 


and the voter should vote for C, if Ro<A* and for C, if R2>A”. This type of 
weighted mean of the ratings of the candidates occurs for all of the voting systems 
examined. Figure 2 shows the critical values for a 3-candidate election with S* = .05, 
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FIG. 2. Critical values for the 3-candidate one man-one vote election (S* = .05). 


the ratings of the candidates normalized so 100 = R; 2 R2= R;3=0 and the T;; values 
approximated using Simpson’s rule over 20 equally spaced points. If the pre-election 
poll predicts an outcome (Fi, F2, F3), then the voter should vote for C2 if and only 
if the point (E>, R2) lies above the curve corresponding to F,. Thus, if (£1, E2, E3)= 
(.25, .35, .40), then the voter should vote for C, if and only if R.>84. Figure 3 
contains the same information for S” = .01. If (Ei, E>, E53) is again (.25, .35, .40), then 
the voter should vote for C2 if and only if R2>53. As it becomes more certain (as 
Ss? gets smaller) that the favorite C; will not win, the voter should be more likely to 
vote for C2, i.c., A* gets smaller. Even if one can not accept the precise nature of 
the quantitative results, the qualitative suggestions concerning whom to vote for should 
still be of value to a concerned voter. 


4. Other voting procedures. The one man-one vote situation described is the 
most common multicandidate procedure and appears frequently in primary elections. 
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FIG. 3. Critical values for the 3-candidate one man-one vote election (S* = .01). 


The remainder of this analysis will determine optimal voting strategies for elections 
involving multiple votes per voter, including approval voting and multiple winner 
elections (committees). 

Occasionally an election is held in which each voter has N >1 votes. Generally 
this occurs for committee elections, but we first examine it for a single winner election. 
We will consider the cases where the voter may (i) cast all of his votes for a single 
candidate and (ii) cast only 1 vote for each candidate. Assume throughout that 
R,2R22:::2 Rm, where M 1s the number of candidates. We also assume that the 
probability that C; loses to C; by 1 vote is approximately the same as the probability 
that C; loses to C; by exactly n votes, 1 Sn SN. As long as the total votes cast by the 
electorate is large compared to the N votes available to a single voter this assumption 
is reasonable, and it greatly simplifies the calculations. 

In the unrestricted case, the voter is allowed to distribute his N votes among the 
M candidates as he sees fit. Let G(u1, v2, +++, Uy) be the expected gain in utility from 
casting v; votes for C, )v;=N. By selecting p;=v,/N, p =(p1, p2,'**, Dm) is an 
M -dimensional probability vector, and the set P of all such probability vectors is a 
convex compact set. 

M-1 M 


G(v1,°+*, Um) = G(piN, +++, paN) = X Xu (pi—pj)* N+ Ty + (Ri Rj) 
t= 


is a linear function of the variables p; on the set P, so G attains its maximum at an 
extreme point of P. But these extreme points are the unit basis vectors of R™, and 
each corresponds to casting all N votes for a single candidate. Thus, it is never uniquely 
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optimal for a voter to split his N votes among two or more candidates. In the case 
of 3 candidates (M = 3), the explicit decision criteria is 
Bt= R,(2T12+ T13) + R3(T23— M13) 


If R2>B*, then the optimal strategy is to cast all N votes for C). If R2<B”, then 
the optimal strategy is to cast all N votes for C;. B* is independent of the number 
of votes N which the voter has as long as the original assumption holds. 

Occasionally a voter has N > 1 votes but is not allowed to cast more than K <N 
of them for a single candidate. This imposes the extra constraint on the p; in the 
previous paragraph that each p; = K/N, but the set of such probability vectors is still 
compact and convex, so the previous argument again shows that the voter should 
‘“lump”’ his votes as much as is allowed by the election rules. In this situation, it may 
be optimal for the voter not to use all of his votes as will be the case in an approval 
voting election. 


5. Approval voting. Approval voting is a recently developed (1976) voting pro- 
cedure in which each voter may cast 1 vote for as many or as few of the candidates 
as the voter ‘‘approves of’’; that is, a voter may cast only one vote for a candidate 
but may vote for as many candidates as desired. It is being extensively examined (see 
[4]J-[6], [11]-[14], [17], [19], [20], [22]) because it appears to have such desirable 
properties. Fishburn and Brams [12] show that if a strict Condorcet winner must be 
elected in a one man-one vote election when the voters use admissible strategies, then 
he must also be elected under approval voting if the voters use admissible strategies; 
but the converse is not true. Merrill [17], [19] has contrasted approval voting with 
several other voting procedures in terms of the relative difficulty for the voter to 
approximate his optimal strategies. Fishburn and Brams [13] also investigated ques- 
tions of ‘power’ and “‘efficacy’’, and Hoffman [14] measured ‘“‘cost” and “relative 
efficiency”. The main decision critera C* is independently due to Merrill [19] and 
Hoffman [14] and is given here for the sake of completeness. 

For N candidates, a straightforward argument implies that the voter should always 
vote for C;, never vote for Cy and should vote for C,, 1<i<N, if and only if R; > C7, 
where 


N N 
Ch = ¥ RTs / ¥, Ti 
Vas i¥i 


So in an approval voting election, the voter should vote for C; if and only if R; is 
greater than a certain weighted mean of the ratings of the other candidates. If all of 
the candidates are deemed equally likely to win, then all of the 7, are equal and C7 
reduces to 


which is independent of i. The decision value C is due to Merrill [17] and Weber [22]. 

In the equally likely case, it is clear that the optimal voting strategy given by the 
model is ‘“‘sincere”’ in the sense that if R, >C, then R;2=R;,>C for all i<k. So, if 
the optimal strategy for the voter includes a vote for C,, then it will also include votes 
for all higher rated candidates. In the nonequally likely case, it is shown in [6] that 
the optimal strategy is sincere for N =3. For N 24, the optimal strategy may not be 
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sincere, but it can be shown [14] that the optimal sincere strategy is almost as good 
as the optimal strategy. 

The techniques of § 3 can be used to calculate the values for C. Figure 4 shows 
the C* values for a 3-candidate approval voting election with ratings normalized to 
100 = R, = R2.=R;=0. Simpson’s rule was used over 20 equally spaced points in the 
calculations. If the pre-election poll predicts an outcome of (£1, E2, E3) with S —_ 0.01, 
then the voter should include a vote for C} if and only if the point (E2, R») lies above 
the curve corresponding to the F; value. Thus, if (£1, E2, E3) =(.25, .35, .40), the 
voter should include a vote for C2 if and only if R2>19. As it becomes more likely 
that C; will not win, the voter should be more likely to include a vote for Co, i.e., 
C™ gets smaller. 
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Fic. 4. Critical values for the 3-candidate approval voting election (S* = .01). 


6. Electing a committee. The analysis of single winner elections can be extended 
to several winner elections, i.e., committees. The use of a probability distribution 
defined on the outcome space is the same as in § 3, but the regions of interest are 
different. We first consider the election of a committee of 2 from a slate of 3 candidates. 

The voter again must assign an expected utility to each of the possible outcomes 
(i.e., Committees) in some reasonable way. The voter could (i) assign a value to each 
committee as a unit or (ii) assign a value to each candidate and the sum of the values 
of the numbers to the committee. Assignment system (i) is a more realistic measure, 
but it would be very tedious with more than 3 or 4 candidates. The analysis preceeds 
independently of the rating system in use. 

Denote the three possible committees x = {C,, C2}, y ={C,, C3}, z ={Co, C3}. Let 
R, be the rating assigned to committee k, k =x, y or z. Let A,, be the area of 


Downloaded 12/31/12 to 150.135.135.70. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php 


758 DALE T. HOFFMAN 


indeterminacy between regions x and y, and let 7, be the probability that the election 
outcome falls in region A,,. 7), can be approximated using the techniques of § 3. It 
is assumed that the probability that C; and C; tie is (approximately) the same as the 
probability that C; loses to C; by 1 vote. If the voter does not vote, then his expected 
satisfaction with the election outcome will be 


R.+R R. +R. +R, 
WR, + WyRy + W.R: + Try| ; *) + Trl oA) + T,.(~ a, 


where W, is the probability that committee k is elected. We assume that the committees 
are labelled so that R, 2 R, 2 R;. 
For a one man-one vote election let 


D* = RAT. _ Ta)+R2T,. + Ty3) 

Ty +21, 
If R, > D*, then G(1, 0, 0) > G(0, 1, 0), so a vote for C, is optimal. If Ry <D*, then 
G(1, 0, 0)< G(0, 1, 0), so a vote for C, is optimal. 

An interesting situation commonly occurs in the election of a committee: the 
voter’s favorite candidate is also the consensus favorite and is almost certain to be 
elected to the committee. Should the voter forego voting for his favorite in this case 
and try to get his second choice onto the committee? Both intuitively and according 
to the model the answer is sometimes ‘“‘yes’’: if the favorite has a strong lead and if 
the second favorite has a reasonable chance of defeating the third candidate. Let 
100 = R, 2 R, 2R, =0, and suppose T,, =0.1 and T,, = T,, =0.01. Then the voter 
should vote for C, if Ry <75, i.e., if committee y = {C,, C3} is sufficiently undesirable. 
If C, increases his lead so T,, = 0.12 and T;., = T,, = 0.005, then the voter should vote 
for C2 if Ry <88.5. 

For a one man-N vote election, we will again assume that the total number of 
votes cast is large compared to N (see §4). For 0<a<1, G((1-—@)N, aN, 0)= 
Tyz(R, —R,)\1-a)N+T,,(R, -R,)1—-2a)N + T,,(R,-R,)(a)N. If R, >D*, then 
G(N, 0, 0)2 G((1—a)N, aN, 0)2 G(0, N, 0) for 0<a <1, so the voter should cast all 
N votes for C;. If R, <.D*, then the inequalities are reversed and the voter should 
cast all N votes for C,. The voter should never vote for C3, and it is never uniquely 
optimal to split votes among candidates. In the general M -candidate case, an argument 
similar to that of § 4 shows that it is still never uniquely optimal for a voter to distribute 
his N votes over several candidates. 

Brams [3] has examined the election of several candidates, a committee, by voters 
who vote in blocs. If some of the blocs have sufficient votes to guarantee the election 
of some candidates, he showed that those blocs should spread their votes over an 
optimal number of candidates. The spreading or grouping of votes thus depends on 
which assumption, if either, holds. 

If a voter is only allowed to cast a maximum of N/2 votes for a single candidate, 
the election is equivalent to an approval voting election with 3 candidates. Let 
E* =(R,Txy + R,T,,)/(Tyy + T,,). If R, > E*, then the voter should vote (N/2, 0, 0). If 
R, < EE”, he should vote (N/2, N/2, 0). 


7. A modification. The foregoing analysis tacitly assumed that the voter was the 
only member of the electorate employing the model to determine for whom to vote. 
In most elections, especially those with a large number of voters, this assumption will 
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cause no problems since few persons are likely to be willing and able to go through 
the necessary computations. However, by making the model slightly more complex 
it is possible to eliminate the assumption. But the revised model will require more 
information about the electorate, and the final decision will now be probabilistic. 
Consider an election to elect 2 of 3 candidates and in which each voter has 1 
vote. Suppose that the various blocs of the electorate rate the candidates as in Table 
1. If each voter votes for his favorite, then the outcome will be C,; with 60%, C> with 


TABLE 1 
An example. 


Electoral % of the total Ratings 
group electorate (R,, Ro, R3) 
I 40 (100, 80, 10) 
II 20 (100, 20, 75) 
III 21 (50, 50, 100) 
IV 19 (61, 100, 60) 


19% and C3 with 21%, so the committee will consist of C; and C3. However, if all 
of the voters use the basic model, then the backers of C>, and C3 will still vote for C, 
and C3, since those candidates are in a close race for the second position on the 
committee. But the backers of C), seeing that their candidate is ‘“‘assured”’ of winning, 
will vote for C2 or C3 and attempt to break the virtual tie between those candidates. 
If all of the voters use the basic model, the outcome will be C, with 0 votes, C, with 
59% and C3 with 41%, so C2 and C3 will be elected even though 60% of the voters 
think C\ is the best candidate. 

Since it is unrealistic in a large electorate to suppose that 60% of the voters could 
get together and decide which of them will vote for C; to insure his election and 
which will vote for the other candidates, we resort to probabilistic considerations. A 
backer of C; will with probability gq use the basic model and vote for C2 or C3 and 
will with probability (1—q) vote for C;. In this way C; can be “‘assured”’ of election 
and the ‘‘excess’’ C, votes can be used to decide between C, and C3. This revised 
method will not require collaboration among the C; backers. 

Let F =the voter’s favorite candidate who is also the favorite of the electorate; 
V =the percentage of the electorate for whom F is the favorite; W = the percentage 
of the total vote needed for a candidate to be assured of election (W =1/3 in the 
previous example) and P = the percentage of the backers of F who will use this revised 
model. P is the additional information required for this revised model. Put q= 
min {1, (1/P)—(W/(PV))}. A user of this revised model should: 

(i) use the basic model (not vote for F’) with probability q, 

(ii) vote for F with probability (1—q). 

If each user of the revised model makes the above determination independently, 
then the expected percentage of votes for F' will be (percentage of voters who favor 
F and do not use the revised model)+ (percentage of voters who favor F and use 
the revised model and still vote for F)=V(1—P)+ V(P)(1-—q)= V—-—VPq= 
V — VP((1/P)—(W/(PV))) = W and W is sufficient to insure the election of F. 

In the example of this section, V = 0.6, W = 1/3 and 59% of the electorate think 
the best committee would be C, (or F) and C). If 40% of the backers of F (groups 
I and II) use the revised model, then P = 0.4 and g = min {1, 1.1}=1, so the 40% of 
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the backers of F who use the revised model should (i) not vote for F with probability 
1, and (ii) vote for F with probability 0. Even if P=1, then gq = min {1, 0.44} = 0.44, 
so the user of the revised model should (i) not vote for F with probability 0.44 and 
(ii) vote for F with probability 0.56. 

The revised model calls for a deviation from the basic model only if a relatively 
large proportion of the backers of the favorite are expected to use the model. It seems 
unlikely that such a deviation will be necessary unless the electorate is small or 
unusually sophisticated. 


8. Extensions and further work. The model extends naturally to the election of 
a single candidate or committee from a slate of N candidates—all that is needed is a 
probability distribution defined on the (N —1)-dimensional outcome space and an 
expected satisfaction rating assigned to each of the possible outcomes. The ideas and 
techniques also extend to the analysis of multistage elections. Suppose that the election 
rules in a 3-candidate single winner election require a runoff election if no candidate 
receives a majority of the votes on the first ballot. The outcome regions for these 
rules are shown in Fig. 5. The voter must now assign utility ratings to 6 possible 
outcomes—3 first-ballot win regions and 3 runoff regions. Once this assignment is 
made and once a probability distribution is imposed on the outcome space, the 
probabilities that the outcome is close to a boundary can be approximated. These 
multistage election models quickly become complicated but should yield reasonable 
results. 


FIG. 5. Outcome space for a 3-candidate 1-winner election with runoff provision. 


An exact personal rating of utility may be difficult to assign, but the truly concerned 
voter should be able to assign ratings in some reasonable fashion. Then the resulting 
recommendations can be viewed qualitatively rather than quantitatively. 

The use of pre-election polls to determine how to vote is certainly open to criticism 
and possible abuse. A numerical analysis of the effects of changes in the poll results 
on the various conditions would be appropriate. But any voting decision model which 
expects to maximize a voter’s effect must somehow take into account the views of the 
rest of the electorate. 
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